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Abstract 

The space-time bit-interleaved coded modulation (ST-BICM) is an efficient technique 
to obtain high diversity and coding gain on a block-fading MIMO channel. Its maximum- 
likelihood (ML) performance is computed under ideal interleaving conditions, which enables 
a global optimization taking into account channel coding. Thanks to a diversity upperbound 
derived from the Singleton bound, an appropriate choice of the time dimension of the space- 
time coding is possible, which maximizes diversity while minimizing complexity. Based on 
the analysis, an optimized interleaver and a set of linear precoders, called dispersive nucleo 
algebraic (DNA) precoders are proposed. The proposed precoders have good performance 
with respect to the state of the art and exist for any number of transmit antennas and any 
time dimension. With turbo codes, they exhibit a frame error rate which does not increase 
with frame length. 

Index terms 

Multiple antenna channels, bit-interleaved coded modulation, space-time coding, 
Singleton bound, interleaving 
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1 Introduction 

The wide panel of today's wireless transmission contexts makes implausible the existence of a 
miraculous universal solution which always exhibits good performance with low complexity. Dif- 
ferent scenarios (indoor, outdoor with low velocity, outdoor with high velocity) correspond to 
different amounts of time and frequency diversity. The success of the multi-carrier modulation as 
a solution for future wireless systems is in part due to the low receiver complexity even over large 
frequency bands. In this paper, we focus on an indoor environment and design a system approach- 
ing the optimal performance taught by information theory. In a wireless indoor environment, both 
time and frequency diversities may be poor due to small terminal velocity and possibly very short 
channel impulse response. These particularly tricky low-diversity channels are modelled as block- 
fading channels. Over low-diversity multiple-input multiple-output (MIMO) channels, space-time 
coding techniques often enable transmission with improved data rate and diversity, within a limit 
given by the rank of the MIMO system pQ [TH] [IH] [H] . These open- loop schemes only require the 
knowledge of the channel long-term statistics. Besides, closed-loop techniques such as beamform- 
ing take benefit from a short-term channel knowledge to improve the performance/complexity 
trade-off at the cost of additional signalling overhead. As a first step in providing increased data 
rates in future generations of indoor wireless local access networks (WLANs), we study how to 
appropriately choose the channel coding, the channel interleaving and the space-time coding. 

For frame sizes of practical interest, coded modulations have to be considered since space-time 
codes employed with uncoded modulations exhibit a frame error rate (FER), which is dramati- 
cally degraded [2H Annex A]. Thus, we focus on the bit-interleaved coded modulation (BICM) 
structure, which is the concatenation of a channel encoder, an interleaver and a modulator. The 
analysis of the BICM maximum-likelihood (ML) performance is tractable and eases the coded 
modulation design. Furthermore, thanks to the interleaver, iterative processing at the receiver 
achieves quasi-ML performance with reduced complexity. On a MIMO channel, the BICM may 
be concatenated with a simple full-rate space division multiplexing scheme (SDM) [21 . In this 
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paper, we improve performance of this space-time BICM (ST-BICM) by replacing the SDM by 
a more efficient full-rate linear space-time code: a linear precoding or equivalently a space-time 
spreading. Linear precoding is performed by multiplying the complex multiple-antenna signal by 
a square complex space-time matrix. The space-time matrix enhances the diversity by mixing the 
symbols of different time periods and antennas together. 

The choice of the ST-BICM structure may also be explained as follows: We aim at optimizing 
a full-rate space-time code based on linear precoding, taking into account the structure of the 
whole transmitter, which inevitably includes an error correction code, an interleaver and a symbol 
mapper. Usually, space-time codes are designed independently from the other elements of the 
transmitter. However, frames of bits are linked through the error correction code and optimizing 
the space-time code taking into account the whole transmitter is equivalent to optimizing a BICM 
concatenated with a space-time code, i.e., an ST-BICM. 

On an ergodic channel, the achieved diversity order is equal to the code minimum distance 
multiplied by the number of receive antennas. In most cases, the minimum distance is high 
enough and increasing diversity through linear precoding does not bring much improvement. For 
a block-fading channel, the diversity is upperbounded by the number of channel realizations in 
a codeword multiplied by the number of transmit antennas and the number of receive antennas. 
Using the Singleton bound, we will exhibit an additional upperbound on the diversity order, which 
may be very limiting without precoding. Hence, in this paper, we will study ST-BICM with lin- 
ear precoding, focusing on the block-fading channel and optimize the linear precoding using the 
ST-BICM ML performance in order to achieve full-diversity and maximum coding gain. First, 
we derive the coding gain of an ideal ST-BICM. It is related to the notion of Shannon code and 
sphere-hardening p|Sj . Indeed, the ideal Shannon code for additive white gaussian noise ( AWGN) 
channels is located near a sphere, called the Shannon's sphere. Thanks to the interleaver, the 
squared Euclidean norm of BICM codewords has low variance, which implies that codewords lie 
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close to the Shannon's sphere. The BICM may be seen as a quantization of this sphere, which 
should be as uniform as possible to maximise the size of Voronoi regions. On MIMO fading chan- 
nels, the Shannon's sphere becomes a Shannon's ellipsoid [20 and BICM codewords are randomly 
located close to the ellipsoid. We show that the ideal BICM configuration maximizes the Voronoi 
region volume whatever the channel realization. We present a practical system that approaches 
the ideal BICM configuration including the so-called dispersive nucleo algebraic (DNA) precoder 
and compare its performance to the ideal ST-BICM performance. The DNA precoder exists for 
any numbers of space and time dimensions. We finally design a practical interleaver, which ap- 
proximates the ideal interleaving conditions. 

The paper is organized as follows: In section the ST-BICM transmitter and the associated 
iterative receiver are presented. In section O we derive the analytical ML performance under ideal 
interleaving assumption for an ergodic channel without precoding and a block-fading channel with 
and without precoding. Using the Singleton bound, we show in section H that ideal interleaving 
conditions cannot be achieved on a block-fading channel with any kind of parameters and that 
linear precoding may be mandatory in some configurations. Section[5]describes the linear precoding 
optimization for a block-fading channel and section[H]thc interleaver design for convolutional codes 
and its application to turbo codes. Finally, simulation results are presented in section [Jj which 
confirm the behavior which was expected from the analytical study. Furthermore, they show 
the good performance of the DNA precoders and the advantage of using turbo codes to get a 
non-increasing FER when the frame size increases. 

2 System model and notations 

2.1 Transmitter scheme 

The transmitter scheme is built from the following fundamental block concatenation: A binary 
error-correcting code C followed by a deterministic interleaver II, a symbol mapper (e.g., for a 
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quadrature amplitude modulation (QAM)), a full-rate space-time spreader S (i.e., a linear pre- 
coder) and a set of n< transmit antennas. Fig. [1] illustrates the BICM transmitter structure. 

Without loss of generality, we assume that the error correcting code C is a convolutional code 
with rate Re- The encoder associates with the input information word b the codeword c G C. 
Sequence b (resp. c) has length KqLc (resp. NqLq) bits, where Lq is the codeword length in 
trellis branches. The intcrlcavcr II, which scrambles the LqNq coded bits, is a crucial function in 
the BICM structure, as it allows the receiver to perform iterative joint detection and decoding. 
Indeed, it ensures independence between extrinsic and a priori probabilities, in both the detector 
and the decoder. Furthermore, when maximum-likelihood (ML) decoding is tractable, interleaving 
prevents erroneous bits of a same error event from interfering to each other in the same precoded 
symbol. The interleaver II may be pseudo-random (PR) or semi-deterministic with some deter- 
ministic constraints as described in section [5] In the symbol mapper, m consecutive interleaved 
coded bits are mapped together onto a modulation symbol, according to a bijection between bit 
vectors and modulation symbols called mapping or labeling. The number of modulation symbols is 
equal to M — 2 m . For each channel use, i.e., in each time period, the mapper reads mn t coded bits 
and generates n t modulation symbols. To make the reading easier, the obtained nt-dimensional 
constellation will denote both the set of symbols and the set of binary labelings. All along this 
paper, we will consider QAM modulations as they achieve a good compromise between spectral 
efficiency (in bits/s/Hz or bits/dim) and performance. Moreover, with QAM modulation, the sys- 
tem is easily modeled using a lattice constellation structure [TU], which gives access to the lattice 
theory toolbox, both for transmitter and receiver optimizations. We assume that the QAM mod- 
ulation has unit energy. The linear precoder S spreads the QAM symbols over s time periods. It 
converts the n t x n r vector channel into an Nt x N r vector channel, where Nt = n*s and N r = n r s. 
The Nt x N t matrix S multiplies a vector of Nt QAM symbols = (zi t i, 2:^2, • • • 1 z i,N t ) at the 
mapper output, generating Nt symbols to be transmitted during s time periods. Vector Zj is the 
j th vector to be precoded. The precoder S spreads the transmitted symbols over a higher number 
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of channel states to exploit diversity. S is normalized as follows: 

N t N t 

U— 1 V— 1 

In this paper, we assume a block-fading channel with n c distinct channel realizations during a 
codeword. We denote n s the number of distinct channel realizations during a precoded symbol. 
To simplify notations, we assume that n s divides n c . We will call channel state the 1 x n r SIMO 
channel associated with one of the n t transmit antennas and one of the n c channel realizations. 
The channel experienced by precoded symbol i is represented by a Nt x N r block-diagonal matrix 
Hi with s blocks of size n t x n r . During one precoded symbol, we assume that each of the n s 
channel realizations is repeated s/n s times. The Hi matrix is organized as follows: 



where HQ 1 denotes the n t x n r complex matrix representing the i-th channel realization experienced 
by the i-th precoded symbol. H-*' is repeated s/n s times. Elements of H-*' arc independent 
complex Gaussian variables with zero mean and unit variance. Let H denote the set of channel 
realizations observed during the transmission of a codeword. Thanks to the extended channel 
matrix, we write the channel input-output relation as: 



where y$ e and each receive antenna is perturbed by an additive white complex Gaussian noise 
rjij, j = l...N r , with zero mean and variance 2No. We define the signal-to-noise ratio E^/Nq, 
where Ef, is the total energy of an information bit at the receiver. Thanks to linear precoding, 
the n t x n r MIMO n c -block-fading channel is converted into an N t x N r MIMO ./V c -block-fading 
channel where iV c = n c /n s . If n s = 1, the precoder experiences a quasi-static n t x n r MIMO 
channel. In the following, index i will be omitted if a single precoded symbol is considered and 



H i = diag(H| 1] ,... ) 



M i H i >■■■.«; ■■■,t±i ) 



(2) 



= Xj + r] t = ZjSHi + rji 



(3) 
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precoding time period will refer to a transmission over SH, i.e., over s time periods. 



The concatenation of the binary error correcting code C, the interleaver II, the mapper f2, the 
linear precoder S and the channel describes a global Euclidean code Ce which converts LqKq 
information bits into a complex LeAc/m-dimensional point. 

2.2 Iterative receiver scheme 

An ideal BICM receiver would directly perform an ML decoding on the set Ce of transmitted 
codewords. However, it requires an exhaustive search among the 2 KcLc codewords, which is 
intractable. All existing receivers use the concatenated structure of the BICM to split the reception 
into several steps. In this paper, we assume perfect synchronization and channel estimation. Thus, 
the receiver, as depicted on Fig. [2j is divided in two main elements: a soft-input soft-output (SISO) 
APP QAM detector, which acts as a soft-output equalizer for both the space-time spreader and 
the MIMO channel, converting the received point y into information on the coded bits in the 
estimated coded sequence c, and a SISO decoder for C, improving the information on coded bits 
and estimating the information bit sequence b. The depicted iterative joint detection and decoding 
process is based on the exchange of soft values between these two elements. The SISO detector 
computes extrinsic probabilities on coded bits thanks to the conditional likelihoods p(yi/z) 
and the a priori probabilities ir(ce) fed back from the SISO decoder: 



where f2 is the Cartesian product (M-QAM)^*, i.e., the set of all vectors z generated by the QAM 
mapper, |0| = 2 mNt . The subset Sl(ct = 1), for £ = 0,1, ... ,mN t — 1, is restricted to the vectors 
z in which the £-th coded bit is equal to 1. The detector independently computes the soft outputs 
for each precoding time period. At the first iteration, no a priori information is available at the 
detector input. Through the iterations, the a priori probability on constellation points computed 




(4) 
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from the probabilities fed back by the SISO decoder becomes more and more accurate. Ideal 
convergence is achieved when a priori probabilities provided by the decoder are perfect, i.e., equal 
to or 1. The decoder uses a forward-backward algorithm [2 ], which computes the exact extrinsic 
probability using the trellis structure of the code. 

3 Theoretical performance for ideally interleaved BICM 

Heavy work has been made to estimate the frame or bit error rate of the BICM with ML decoding, 
in particular using Gaussian approximations or numerical integrations [S], but a closed- form ex- 
pression of the pairwise error probability had not been derived yet. This section first describes an 
accurate computation of bit and frame error rates of BICM ML performance over ergodic MIMO 
channel with ideal interleaving and without precoding. A more detailed description of the deriva- 
tion may be found in [23] and |24j . Under ideal interleaving condition, we are able to derive a 
closed form expression of the probability density of the log likelihood ratio (LLR) at the output 
of the detector and then a closed form expression of the pairwise error probability at the output 
of the decoder. It is then straightforward to use well-known techniques to estimate the bit or 
frame error rate of a coded modulation from pairwise error probability. This subject has been 
extensively discussed for coded modulations over AWGN channels. Examples are the union bound 
on the transfer function of a convolutional code and the more accurate tangential sphere bound 
[SI)] f° r spherical constellations. 

Subsequently, we extend the study to the block-fading MIMO channel when linear precoding 
is used at the transmitter. Note that the method is also valid for correlated MIMO channels. We 
extract from the bit error rate expression some design criteria on BICM precoder, interleaver, and 
error correcting code. 

3.1 Ideal interleaving condition 

The evaluation of the bit error rate (BER) or frame error rate (FER) of a coded modulation 
is usually based on the derivation of an upper bound on the actual performance obtained by a 
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balanced summation of pairwise error probabilities. Each pairwise error probability involves the 
Euclidean distance between two codewords with a Hamming distance w. 

With an n t x n r MIMO block-fading channel with n c blocks, the minimum diversity recovered 
at the detector output, and thus at the decoder output, is always equal to the reception diversity 
n r . Let us consider an error event with w erroneous bits. Assume that the maximum diversity 
order is T max . If w > T max /n r , we achieve full diversity if each of the T max /n r independent 
fading random variables is experienced by at least one bit among w. In a precoding time period 
k in which at least an erroneous bit is transmitted, the transmitted and competing points are 
called Xfc = ZfeSHj, and x' fc = z'^SHfe. When performing ML decoding or APP detection, we 
are interested in the equivalent Binary Shift Keying (BSK) modulation defined by the two points 
Xfc and ~x! k . The vector (zfc — zJ,)SHi has sn r circular symmetric Gaussian components. Thus, 
whatever the number of erroneous bits on a precoding time period, the obtained diversity is limited 
to sn r . Having several erroneous bits per precoding time period is useless. On the contrary, if 
the erroneous bits are located on different precoding time periods and experience different fading 
random variables, a higher diversity is achieved. This is what we call the non-interference property. 
Furthermore, we will see in section [5] that an equi-distribution of erroneous bits on channel states 
is required to achieve a maximum coding gain. We call it the equi-distribution property. The ideal 
interleaver is defined as follows: 

Definition 1 (Ideal Interleaving) For any pair of codewords with w different bits at positions 
ii, . . . , ifc, . . . , i W) an ideal interleaver allocates the bits to transmitted symbols as follows: 

• Non-interference property: Vifc,ifc/, bits at positions ik and ik> are transmitted on different 
precoding time periods, 

• Equi-distribution property: the bits at positions i±, . . . , ik, ■ ■ ■ , i w are as equiprobably dis- 
tributed over all channel states as allowed by w. 

In practice, such an interleaver does not always exist. We will see in the following that the 
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Singleton bound gives an existence condition of the ideal interleaver. In section we present 
optimized interleavers that approach the ideal condition. 

3.2 Exact pairwise error probability for ergodic channels without pre- 
coding 

In [23) , we have established a closed form expression for the conditional pairwise error probability 
on ergodic MIMO channels under ML decoding of the BICM and ideal channel interleaving. The 
mathematical derivation in this subsection follows |23j . Transmitted symbols are not precoded: 
s = 1, S = I„ t the n t x n t identity matrix. Thus, ^ reduces to = H^. Consider the 
pairwise error probability that a codeword c € C is transmitted and a codeword c' € C is decoded. 
The w different bits between the two codewords are transmitted in w different time periods, 
complementing one bit in the mapping of one of the n t QAM symbols. The transmitted noiseless 
vectors corresponding to the two codewords (c, c') only differ in w positions. Let us define Z = 
(zi, . . . ,z w ) and Z' = (z' l7 . . . , z' w ) the wnt-dimensional vectors corresponding to these positions 
and X = (xi, . . . , x^) and X' = (x' 1; . . . , x^) the um r -dimensional vectors corresponding to Z and 
Z' and filtered by the channel matrix Hj.. 

We define dk = ||zfc— z' k \\. The Euclidean distance ||X— X'| depends on both the set of distances 
{di, . . . , dk, ■ ■ ■ , d w } and the set of channel realizations Ti. Let D denote the set of all Euclidean 
distances obtained by flipping one bit in the constellation fL Define the set A = {Si, . . . , 5 nd ] C D 
with distinct elements from the sequence (di, cfe, • • • , d w ) € A w C D w , i.e., the Euclidean distance 
dk takes its values from the set A. Obviously, rid — |A| < \D\. Let the integer denote the 
frequency of dk in the sequence (di, d2, ■ ■ ■ , d w ), J2Z=i^ n ~ w ana * ^ = • • • > ^n d }- The 
pairwise error probability conditioned on the channel realization set Ti. and the Hamming weight 
w is expressed as 

P w m{c -> c') = P w ,nOi -f X') = P LLR k < 0^ (5) 
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where LLR^ is the fc-th LLR, corresponding to the fc-th error position, and is equal to 



Rk = ||(zfc — z' fc )Hfc|| 2 has a chi-square distribution of order 2n r . Averaging over Tt, we calculate 
the characteristic function of E-h EfcLi LLR)~\: 



^)=(n(4)" Jl n ^+^-^1 <-■> 

\fc=l V U/ / \n=-n d ,n^Q 



where 



n>0, p n = \ (l + yjl + 
n<0, /3„ = I f l - /l ^ Ma' 



(8) 

Applying a partial fraction expansion, we obtain the expression of the pairwise error probability: 



r,. r \ n r n d n r X r , 
ZI\ 



p. ( x^xo=p.(A,A)=n(-^) eetttt^v (9) 



k=l x k y n=l i=l 



where the coefficients a n ^ are given by an identification of the coefficients of two series expansions 
in e as in [23] . 

We compute the asymptotic expression when the noise level is low. Indeed, the coding gain 
and diversity are measured for high signal-to-noise ratios, where the performance has a linear 
asymptote on logarithmic scales. 

P(AA) ~ (2n r w-l\^(2No\^ (2n r w-l\( 2N \™- 

with (?) = n\/ (k\{n — k)\). The diversity associated with the considered pairs of Hamming weight 
w is the exponent of 2Ao, equal to wn r . We define the coding gain or coding advantage as the 
coefficient dividing 2Ao, i.e., 

/ w \ l / w 

g ergo (A,A)= (j[ d l) (ii) 
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All sequences (d\, . . . ,d w ) corresponding to the same pair (A, A) yield the same pairwise error 
probability. By averaging over all possible pairs (c, c') or equivalently over all sets of distances 
D w , we obtain P w — Ed™ [P w (A,A)], the conditional probability that an error event of Hamming 
weight w occurs. From this pairwise error probability, it is easy to estimate the FER or BER of 
the BICM with ideal interleaving thanks to a classical union bound on the weight enumeration 
function of the error correcting code. Moreover, we may derive a design criterion of the BICM from 
the coding gain Q ergo (A, A) expression. In the following, we derive the coding gain for block-fading 
channels and linear precoding in order to obtain the ML design criterion of the ST-BICM. 

3.3 Exact pairwise error probability for MIMO block fading channels 
without precoding 

We assume that Definition 1 is satisfied. For a block-fading channel with n c independent realiza- 
tions in a frame, the decision variable between X and X' is still given by ([5|). However, the involved 
channel matrices are not independent as for an ergodic channel. The conditions of independence 
are the following: 

• If two LLR random variables depend on two different channel realizations, they are indepen- 
dent. 

• If two LLR random variables depend on the same channel realization but on different transmit 
antennas, the random variables are independent. 

The maximum number of independent LLR variables is n c nt, the transmit diversity order. 
We choose the error correcting code so that w > n t n c . We now group the w random variables 
LLR into min(n t n c ,w) = n t n c independent blocks. Let LLR^^i^ be the z-th log-likelihood ratio 
corresponding to the BSK transmission on the l-th. antenna of the k-th block, k = 1 . . . n c , I = 
1 . . . n t and i = 1 . . . Kk,i, where Kk.i is the number of bits transmitted on the l-th antenna of 
the fc-th block. We have 53fe=i E"=i K k.i = w. Finally, LLR is the sum of the ntn c independent 
random variables LLR)~ i — Y)j—i LLRk.i.i- Let dk,i,i denote the distance associated with LLRk.i,i, 
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and define 7^ ; = J2i='i d\ ; i the distance associated with LLRk^. We have 

where R^j = 7^ ; ||Hfc(Z)|| 2 and Hfc(i) is the Z-th row of For all i, LLRk,i,i are transmitted 
over the equivalent 1 x n r SIMO channel defined by Hfc(Z), which is chi-square distributed with 
degree 2n r . The LLRk i variables are transmitted on independent channel states, as for the ergodic 
channel case, we directly apply © and obtain the conditional pairwise error probability closed- 
form expression 



Tic n t / \ Ur n d n r X n 



k=l 1=1 



7 fe j 



where 5 n £ A and (A, A) is the pair of sets representing the sequence (71,1, . . . , 7n t ,n c )- The a r , 
coefficients are computed as for ©. 

The asymptotic expression of P W (A, A) is 



The diversity associated with the considered pairs of Hamming weight w is then equal to the 
exponent n t n c n r . The coding gain is given by the geometrical mean of the 7^ ; and is equal to 

g bf (A,A)= nnE<M (i5) 

\fe=i;=ii=i / 

We will see in the following how to use this coding gain as a design criterion for the ST-BICM op- 
timization. We now consider an equivalent computation of the coding gain for a linearly precoded 
ST-BICM. 



13 



3.4 Exact pairwise error probability for MIMO block fading channels 
with precoding 

When a linear precoder S of size N t x N t is used, the detector computes soft outputs on the N t 
transmitted symbols using the equivalent channel matrix SHj, of size Nt X N r . The structure of 
Hfc is described in J2]). SH^ can be seen as a correlated MIMO channel L 43J . Under the ideal 
interleaving condition, we consider at most a single erroneous bit per block of s time periods in 
position 1 < I < mN t inside the binary mapping of the transmitted symbol z, leading to symbol 
z e . For simplicity reasons, we assume that the error weight w satisfies w > N t N c . Moreover, 
we assume that the mapping is mono-dimensional: the BSKs are transmitted on a single selected 
input of the matrix SHj,. Let LLRk,i,% be the z-th variable among Kk,i, corresponding to the 
transmission of a BSK on the equivalent 1 x iV r channel S/H/j, where S; corresponds to the l-th 
row of S. We have J^fcii K M = w - Let dkj.i denote the BSK distance associated with 

LLRk,i,i- We can use the factorization LLR k j — Yli=i LLRk.i^ of all the LLR variables filtered 
with SjHjt: 



where R k j = \\ V k jTI k \\ 2 , ~V k j = -f k jSi and 7^ = J2i=i d l,i,i- Tne variable R k j is a generalized 
chi-square random variable with 2N r correlated centered Gaussian components. The random 
variable LLR k = J^/Si LLRk,i satisfies 




(16) 




(17) 



From appendix [XJ we get the following characteristic function: 



En k =11 ft l 1 



2N Q k >' 



U 



) 



(18) 
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where fl^u * s the u ~th eigenvalue of 



N t 



N t s/f 



/ , lk,l "J "i 

{=1 i=l 



(19) 



is as an nt x nt Hermitian square root matrix of xtf and row vectors s| of size n t and 
s/n s x n t matrices S'E are denned from S as follows: 



Sj 1] (lx N t /n s ) 



Si 



[i][i] 



si 



[l][s/n s 



j[l][l] 



c[l][s/r 



si 



■[«s][l] 



eKffl 



Si 



n s ] [s/n s 



[n s ][s/n s 



iVf/s = n t coefficients 



(20) 



and 



Mm 



jWp] 



(21) 



s 



[i][«/r. 



The set of eigenvalues #[*■' is a function of the precoding matrix S and the BSK distances set 
D w . Thanks to the independence of channel realizations for different k values, we can multiply 
the characteristic functions: 



N c n s nt 

*ov)=nnn( i 

k=l t=l u=l 



V V -") #[*] 

2N k<t 



(22) 



Denote A = {S v } the set of ns square-roots of non-null eigenvalues extracted from the sequence 
defined by the values. Each eigenvalue d% is repeated A„ times. Observe that ns < n c n t . 
Finally using the partial fraction expansion of ^(jv) as for ©, we obtain the exact pairwise error 
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probability P W (A,A) conditioned on dff(c, d) 



= w: 



( 



2No 
S 2 



) 



A„n r ns n r \ v 



P W (A,A) = U 



EE 



(23) 



v=l 



v—1 i—1 



The asymptotic expression of P W (A, A) is 



(2n r N s - A fr /2jV \ 



X v n r 



P W {A,A) 



N ^0 



(24) 



where N$ = YlvLi is the total number of non-null eigenvalues. 

The diversity associated with the considered pairs of Hamming weight w is the exponent equal 
to J2vLi ^viT-r — Ngn r . The coding gain is given by 



We have derived for any signal-to-noise ratio an exact expression of the pairwise error proba- 
bilities of a BICM with linear precoding, which is useful for a tight BER and FER estimation. The 
asymptotic expression leads to the well-known rank and determinant criteria [40j |19j for space-time 
code optimization over MIMO block-fading channels, where the considered space-time code is the 
whole BICM structure. As a remark, the asymptotic design criterion is usually derived by first 
upperbounding the Q{x) function by exp(— x 2 /2)/2 and then averaging over the channel realiza- 
tions. The obtained asymptotic expression has a multiplying coefficient different from ( 2 "j^7 1 ), 
which is inexact but provides the same design criterion. 

Moreover, we notice that applying the Tarokh criterion [30] on the rank and determinant to 
the precoder alone does not lead to the whole BICM optimization. Quasi-optimal linear precoders 
will be designed to achieve full diversity and approach optimal coding gain in section [51 




(25) 
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3.5 Evaluation of the Frame Error Rate 

For ergodic channels, the frame error rate is easily computed via a union bound. Indeed, only 
error events with minimum Hamming distance impact the error rate for a high signal-to-noise 
ratio and the observed diversity is equal to n r dnmin- For block- fading channels, the frame error 
rate computation is much more tricky since each pairwise error probability is supposed to have 
the full-diversity order n c n t n r . Due to the random nature of each eigenvalue in (|25p . it is difficult 
to know the impact of each distance configuration on the final FER. 



However, one may assume that for a sufficiently high signal-to-noisc ratio, the FER satisfies 
the following expression: 

FER~^A w E (A>Alw) [P w (A,A)} (26) 

w 

where A w is weighting the impact of pairwise error probabilities with Hamming weight w in 
the global error probability and the expectation on (A, A) is allowed by the interleaver random 
structure. Let us define Q the global coding gain. Since each pairwise error probability is supposed 
to have full diversity, we write 



pER (27) 



n r n t n c J \ G 
and 

g-n r n t n c = £ A w E {AtA[w) [Q(A, A)~™°] (28) 
w 

where £7(A, A) is the coding gain associated with one pair of codewords. We note that optimizing 
independently all pairwise error probabilities, which will be done in the following, enhances the 
global performance. Moreover, we observe that the number of receive antennas does not affect the 
coding gain of a single pairwise error probability. The effect of the receive diversity appears in the 
expression of the global coding gain (j2"5|) . As n r n t n c grows, the smallest coding gains have more 
impact on the final performance. Asymptotically, if n r n t n c — > +00, only the nearest neighbors in 
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the Euclidean code have an influence on the FER, as for AWGN channels. 

We will see in section 15.11 that the best coding gain is achieved when all eigenvalues $P are 
equal. In this ideal configuration, the coding gain is shown to be the same as with the same coded 
modulation transmitted on a 1 x n c n t n r quasi-static SIMO channel. Simulating this latter case is 
less complex: the performance curve is semi-analytically computed using a reference curve on an 
AWGN channel. Alternatively, performance may be obtained by computing the Tangential Sphere 
Bound for spherical modulations [57]. In the following, ideal BICM will refer to the performance 
of the ideal configuration, which will be drawn on simulation results. This lower bound has the 
advantage to take the modulation and error correcting code into account and will be useful to 
evaluate the optimality of both the linear precoder and the channel interleaver. 

4 The Singleton bound with linear precoder 

Definition [T] ensures that any pair of codewords benefits from a full diversity order. In this section, 
we derive a condition on the existence of a practical interleaver that could achieve the conditions 
of Definition [TJ Let us first make the following assumption: 

Assumption 1 The detector perfectly converts the N t x N r correlated MIMO N c -block- fading 
channel SH/j with QAM input into a 1 x sn r SIMO n t n c / s -block- fading channel with BSK input, 
assuming that s is a divisor of ntn c . 

We will present in section [5] linear precoders that satisfy Assumption [T] Under this condition, 
the detector collects an amount of diversity equal to sn r . The full diversity ntn c n r is collected 
by the detector when s = ntn c , but unfortunately, the APP signal detection has an exponential 
complexity in s. On the other hand, the BICM channel decoder is also capable of collecting a large 
amount of diversity, but the latter is still limited by the Singleton bound [29 30J [34 . Hence, the 
lowest complexity solution that reaches full diversity is to draw advantage of the whole channel 
code diversity and recover the remaining diversity by linear precoding. The best way to choose 
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the spreading factor s is given by the Singleton bound described hereafter. 



The studied ST-BICM is a serial concatenation of a rate Rc binary convolutional code C, 
an interleaver of size NcLc bits, and a QAM mapper followed by the precoder as described in 
section [5J When S is the identity matrix, the ST-BICM diversity order is upper-bounded by [3D] : 



The maximal diversity given by the outage limit under a finite size QAM alphabet also achieves 
the above Singleton bound [5S]. With a vanishing coding rate, i.e., R c < l/(n c n t ), it is possible to 
attain the overall system diversity order n r n c nt produced by the receive antennas, the transmit 
antennas and the distinct channel states. Unfortunately, this is unacceptable due to the vanish- 
ing transmitted information rate. Precoding is one means to achieve maximum diversity with a 
non-vanishing coding rate. 

The integer Nb = n c rit/s is the best diversity multiplication factor to be collected by C. The 
length of a C codeword is LcNc binary elements. Let us group LcNc/Nb bits into one non-binary 
symbol creating a non-binary code C. Now, C is a length- Nb code built on an alphabet of size 
2L C N C /N b ^ rp^ e g m gi e t on bound on the minimum Hamming distance of the non-binary C becomes 
Dh < Nb — \NbRc \ + 1- Multiplying the previous inequality with the Nakagami law order sn r 
yields the maximum achievable diversity order after decoding [22] : 



Finally, since T is upper-bounded by the channel intrinsic diversity and the minimum Hamming 
distance dnmin of the binary code, we can write 



T < n r ([n c n t (l- R c )\ + 1) 



(29) 



T < sn r 



n c n t 



(l-i?c) + l 



(30) 



•s 



T < min 



( snr ^(l-7? c ) + lj; 



ntn c n r ; sn 




= T 



777 a X 



(31) 
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If dtfmin is not a limiting factor (we choose C accordingly), we can select the value of s that 
leads to a modified Singleton bound greater than or equal to ntn c n r . 

Proposition 1 Considering a BICM with a rate Rc binary error- correcting code on an n t x n r 
MIMO channel with n c distinct channel states per codeword, the spreading factor s of a linear 
precoder must be a divisor of n t n c and must satisfy s > Rcn c n t in order to achieve the full 
diversity n t n c n r for any pair of codewords. In this case, the ideal interleaving conditions can be 
achieved with an optimized interleaver. 

The smallest integer s op t satisfying the above proposition minimizes the detector's complexity. 
If Rc > 1/2, then s opt = n c nt which involves the highest complexity. If Rq < l/(n c n t ), linear 
precoding is not required. 

Tables [T] and [2] show the diversity order derived from the Singleton bound versus s and n*, for 
n c = 1 and n c — 2 respectively. The values in bold indicate full diversity configurations. For 
example, in Table [TJ for n t = 4, s — 2 is a better choice than s = 4 since it leads to an identical 
diversity order with a lower complexity. 

5 Linear precoder optimization 

Many studies have been published on space-time spreading matrices introducing some redundancy, 
well-known as space-time block codes. On one hand, some of them are decoded by a low-complexity 
ML decoder, but they sacrifice transmission data rate for the sake of high performance. Among 
them, the Alamouti scheme ]T| is the most famous, but is only optimal for a 2 x 1 MIMO chan- 
nel. The other designs allowing for low ML decoding complexity are based on an extension of the 
Alamouti principle (e.g., DSTTD [J2]) but also sacrifice the data rate. On the other hand, full rate 
space-time codes have recently been proposed [I] [TT] [T^] [13] [H] [IB J [35 . However, their optimiza- 
tion does not take into account their concatenation with an error correcting code. In this section, 
we describe a near-ideal solution for linear precoding in BICMs under iterative decoding process. 
Our strategy is to separate the coding step and the geometry properties in order to express some 
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criteria allowing the construction of a space-time spreading matrix for given channel parameters 
nt, n r and n c . The inclusion of rotations to enhance the BICM performance over single antenna 
channels has been proposed in 32J. Our solution uses this concept for designing a space-time code 
including a powerful error correcting code. 

When the channel is quasi-static or block-fading with parameter n c , the diversity is upper 
bounded by n c n t n r which may be more limiting than n r dumin (e.g., n t — 2, n r — 1, n c =l). 
We introduce a new design criterion of space-time spreading matrices that guarantees a diversity 
proportional to the spreading factor, within the upper-bound, and a maximal coding gain at the 
last iteration of an iterative joint detection and decoding. 

5.1 Coding gain under both ideal interleaving and precoding 

First we look for the best achievable coding gain for the fixed parameters nt, n ri n c , Rq and the 
appropriate way to choose the error correcting code, the binary mapping, the linear precoder and 
its parameters s and n s to achieve the ideal coding gain. 

We want to achieve full diversity under ML decoding or iterative joint detection and decoding, 
this induces that there are n c nt non-null eigenvalues (see (|2~i)) ): 



Furthermore, we want to maximize the G s ,n 3 (A, A) expression. Assuming that each row Sj is 
normalized to 1, we get 




(32) 



N c n s nt 



N c N t 



N c N t Kfc.i 




(33) 



k=l t=l u=l 



k=l 1 = 1 



k=l 1=1 i=l 
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Under this condition, the ideal coding gain is achieved when all eigenvalues are equal 

N c N t 2 

<l = EE^f V(M < U) (34) 
k'=i 1=1 1 c 

which leads to 

N -= N t ~2 w 2 

The exact pairwise error probability expression simplifies to the classical expression of the perfor- 
mance of a BPSK with distance ■_ 1 over a diversity channel with order n c ntn r [37j : 

(36) 

As stated in the introduction, in an ST-BICM, precoded modulation symbols quantify the 
Shannon sphere and best quantization is obtained by uniformly distribute them on the sphere. 
After transmission on a fading channel, vectors belong to an ellipsoid obtained by applying an 
homothety on the sphere. From (l34|) and (135 j) . we see that the ideal coding gain is obtained by 
equally distributing the Euclidean distance between two codewords among the ntn c channel states. 
Hence, the Euclidean distance varies as a n t n c n r Nakagami distribution, according to the square 
norm of the ellipsoid axes. Thus, an ideal ST-BICM aims at uniformly distributing the precoded 
modulation symbols, whatever the channel realization, i.e., whatever the homothety. The ideal 
coding gain is a fundamental limit which cannot be outperformed. It is useful to evaluate how 
optimal the practical design of a BICM is. We aim at finding the best design, corresponding to 
eigenvalues which are as close to each other as possible. The more different from each other the 
eigenvalues are, the lower the product in (f32|) and the coding gain are. From (|35|) . we see that 
the ideal coding gain is the same as for the same coded modulation transmitted on a 1 x n c iiin r 
single-input multiple-output (SIMO) channel, applying the appropriate E^/Nq normalization. 

Without linear precoding, the ideal coding gain is only achieved if all jk,i are equal. Remember 
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that each jk,l is a sum of Kk,i distances dfc,J,i- Thanks to the second point in Definition [IJ the 
Kk,i values are close to w/(ntn c ) and their variance decreases when w increases. Thus, with a 
powerful error correcting code having minimum Hamming distance much greater than n t n c and 
\D\, each jk,i value is almost equal to the average Y^,k=i Ei=a <%k,l,il '(N t N c ) of dk,i,i values 
and quasi-ideal coding gain is observed. 

If the error correcting code is not powerful enough to achieve the ideal coding gain, i.e., the 
jk,i values are very different, the linear precoder provides an additional coding gain by averaging 
the jk.i values, as we will see in the following. First, we derive the optimal coding gain which can 
be achieved using an ideal linear precoder for a given binary labeling and error correcting code. 
Variables ^k.i for different k values correspond to independent channel realizations Hfe which are 
not linked by the linear precoder. Thus, random variables n™=i TluLi $fc « are independent for 
distinct values of k. The optimal coding gain with linear precoding is 

^,o Pt (A,A) = n^E^J (37) 

Equation (|37| means that an optimal linear precoder is capable of making eigenvalues equal for 
a same k. However, for different values of k, eigenvalues are different, which induces a 
coding gain loss. When the mapping and error correcting code are given and the interleaving is 
ideal, the choice of linear precoding parameters impacts on optimal coding gain. Let us consider 
codewords that are equidistant from the transmitted codeword, i.e., a set of distance configurations 
corresponding to a same value of , 7^ ; . The variance of J^jSi Ik i/( n t n s) over this set decreases 
when n s increases, as the number of distances building each 7^/ is higher. The lower the variance 
of eigenvalues, the higher the coding gain. Thus, Gs,n s ,opt(A, A) is an increasing function of n s and, 
for a given s, we should choose n s — min(s,n c ). The optimal coding gain G s ,min(s,n c ),opt{A, A) is 
an increasing function of s. If n a — n c , which implies s — n t n c , the ideal coding gain is achieved 
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by the optimal precoder. Finally, we can surround the coding gain at full diversity as follows: 

Vs g ideal {A,A) >g s 

,min(s ,n c ) ,opt 

(A, A) > g S) i, op t(A,A) > G hhopt (A,A) > g bf (A,A) (38) 

If, for any pairwise error probability, g^f^A, A) ~ gideal(A, A), the linear precoder optimization 
is useless from a coding gain point-of-view. However, obtaining near-ideal coding gain without 
precoding requires an optimization of the error correcting code and mapping for any pairwise error 
probability, which is intractable for non-trivial modulations and codes. Furthermore, the first ob- 
jective of linear precoding is the diversity control, which has a high influence on the performance 
even at medium FER (10~ 2 ~ 10 -3 ), especially for low diversity orders. Therefore, precoding is 
often useful in the BICM structure. 

After the impact of the linear precoding for a given pairwise error probability, let us consider 
the behavior of the global performance under linear precoding. As stated in section T3.51 if n r ntn c 
grows, the pairs of codewords providing the smallest coding gains have more impact on the final 
performance. Since the linear precoder provides a more substantial gain for the low Hamming 
weight configurations, the coding gain of the linear precoder will be magnified as the diversity 
grows. 

Example of ideal coding gain: In order to illustrate the role of the linear precoding in the 
coding gain optimization, we consider a 2 x 1 quasi-static MIMO channel and a pairwise error 
probability between two codewords separated by a Hamming distance of w bits. Fig. [3] represents 
the distribution of the two 71 and 72 values over the two transmit antennas without linear precod- 
ing. Bits transmitted on antennas 1 and 2 are transmitted on the sets of time periods T\ and T2, 
respectively. Thanks to ideal interleaving, T1QT2 = 0. This illustrates the factorization of the 
distances into the 7 values. The instantaneous coding gain is equal to ^/7i7l • Now let us consider 
a specific linear precoder, which spreads the values 71 and 72 as presented in Fig. 0]over two time 
periods and two transmit antennas dividing the squared distance in two equal parts 7i/2 and 7I/2 
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respectively. The average value (jf + 7 2 )/2 is transmitted on each antenna, the coding gain is 
optimal and equal to (7^ +7D/2. For example, consider a BPSK modulation and a pairwise error 
probability with Hamming weight 3. With optimal linear precoding, the ideal interleaving provides 
for example 7 2 = 2 x 2 2 and 7! = 1 x 2 2 . With optimal linear precoding, we have a distance 
(2 x 2 2 + 1 x 2 2 )/2 associated with each antenna. The ratio between the two averaged coding gains 
is equal to \/9/8, i.e., we expect a gain of 0.26 dB when using linear precoding. With w = 5 and 
to = 11, the coding gain becomes 10 log 10 ( y/24/25) ~ 0.09 dB and 10 log 10 (V 120/121) ~ 0.02 dB, 
respectively. The higher the Hamming weight involved in the pairwise error probability is, the less 
the coding gain provided by linear precoding is. I 

We see on Table [3] the best gain to be provided by linear precoding for a quasi-static channel 
with BPSK input with respect to a full diversity unprecoded scheme. These gains are particularly 
low because the error correcting code aims at recovering a large amount of coding gain. This 
illustrates that BICMs are very efficient transmission schemes. As a remark, if a modulation with 
higher spectral efficiency is used with Gray mapping, the nearest neighbor in the Euclidean code 
has the same distance configuration as if a BPSK modulation was used. Moreover, for high di- 
versity orders, the global error rate for high E^/Nq will be dominated by the neighbors and the 
gain provided by linear precoding will be very close to the ones shown in Table [3j However, if the 
diversity is low, the gains provided by linear precoding may be much more important. Assume 
that a 16-QAM modulation with Gray mapping is transmitted on a n t = 2 quasi- static channel. 
For instance, if w = 5, there exists a neighbor with distance configuration (3A, 3A, 3A, A, A) (e.g., 
see [53]), and 7 2 = 9 A 2 + 9 A 2 + 9A 2 , 7I = A 2 + A 2 . The gain to be provided by linear precoding 
is equal to 101og 10 (29/2/V54) = 2.95 dB. As already stated, the final coding gain is equal to a 
weighted sum of all the coding gains, where the weighting coefficients cannot be easily computed 
in the case of low diversity orders. 

Even if linear precoding does not always provide a substantial coding gain, its prior aim is the 
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diversity order control. Thus, we will focus on the design of linear precoders aiming at reaching 
full diversity and maximizing the coding gain for any set of parameter (n t , s,n s ). 

5.2 A new class of linear precoders 

Under linear precoding, the optimal coding gain is achieved if all $1 variables are equal for a 
same k. Let us first consider the eigenvalues associated with the independent realizations in the 
spreading matrix, indexed by t. First, two matrices M^' 1 ' and Mj* 2 ', as introduced in (|19p . should 
have the same eigenvalues, which is satisfied if V(fi, t 2 ), M|* l] = R^'^M^R* 1 -* 2 , where R* 1 '* 2 
is a unitary matrix, for example a rotation. Hence, V(ti, £2), S'| = S'| R' 1 '* 2 . The precoding 
sub-part Sj , with spreading factor s' — s/n s , experiences a quasi-static channel. We assume 
that s' is an integer, divisor of nt. It is sufficient to design the first sub-part of the precoder matrix 
rows for a quasi-static channel and rotate it to compute the other sub-parts. Furthermore, any 
choice of R' 1 '* 2 leads to the same performance because the eigenvalues remain unchanged. The 
condition simplifies to ||sf = ||S'f 2] ||. 

Let us now optimize for a given index t the equivalent precoder over the quasi-static channel 
diag ^h[*', . . . ,H^J, in which Hj*' is repeated s' times. If all the eigenvalues of M^M^* are 
equal, and M^* are weighted unitary matrices and 

N t a' 

M W M W. = M M. M W = £ 7 2 1 £ s p* s p] (39) 

1=1 i=l 

Matrix ■'*Sj'' has rank one and matrix Si s| has maximum rank s' . If s' < n*, 

it can be shown that it is impossible to get all eigenvalues equal to 7fe ;/ n * n s as required to 

achieve the optimal coding gain. However, in order to insure that M^M^* has a rank n t and 
that the eigenvalues are as equal as possible, we group ss' values jk,i together and associate them 
with one of the n t /s' groups of s' eigenvalues: we denote s{*" l "' J ' the j-th sub-part of size s' of 
s|'" 1 ' and {l2,h} the index of the (h — l)ss' + h-th row of S, where h € [l,n t /s'],Zi e [1, ss']. 
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Let us assume that sK, has only one non-null sub-part in position I2, i.e., 



Vj^fe S{« = [0,...,0] (40) 

Considering such a structure is equivalent to considering separate precoding on rit/s' distinct 
groups of s' transmit antennas. We have 

N t nt/s' ss' s' 

V^ 2 s' w *s' [t] - V Vs 1 ' 11 ' 1 ' s [t][iI an 

2^7k,i*i ° 1 - 2^ Z^ 7 fc.{Wi}2^ & {Wi} & {Wi} ( - 4i ' ) 

1=1 / 2 =1 ii=l i=l 

nt/s' ss' Is' \ 

--X^\^^2 ^ [ ^ S [*][i][i2]* S [i][i][l 2 ] U9] 

i 3 =l ( 1= 1 \i=l / 

where D; 2 (v4) is a block diagonal matrix with only one non-null block A in position I2. We choose 
^{li'li}^ P r °P or ti° na l to the i-th row of a s' x s' unitary matrix, such as ||S^ l j^ || 2 = 1/s: 



1=1 h=i h=i v y 



(43) 



1 

- £ diag (7fc,{i,i l} W • -i-ylint/s'M 1 *') ( 44 ) 



s 

h=i 



which leads to 

1 ' s ~ ' 

h < th/s', u < s', <\ /2 _ 1)ss , +u = -J2 li{hM ( 45 ) 

h=i 

The random variables 7 2 ^ j i are independent and identically distributed for different values of 
l\ and I2, the coding gain is 

N c n t ,s> ( ss' , \ S ' /(N ^ ] 

uA,A)=nn (46) 
k=i i 2 =i \h=i s j 

For any value of n s , the gain expressed in (|46jl is a geometric mean of order ntn c /s. For a 
given realization {di,...,d w }, a given s and for any n s , J2k=iJ2?2=i ^h=i^k,{i 2 ,h} and tllus 
U=r7 fc 2 

{h.h} are constairt ; ensuring the same coding gain. However, such a precoder does not 
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achieve the optimal coding gain for any value of s' . The summation is made over ss' different 
values whereas the optimal coding gain in (|37[) necessitates a summation over snt values. Only if 
ss' is high enough, the obtained coding gain is almost optimal. If s' — n t , the complete spatial 
transmit diversity is collected by the detector and the optimal coding gain is achieved. 

Proposition 2 Dispersive Nucleo Algebraic (DNA ) Precoder Let S be the N t xN t precoding 
matrix of a BICM over an t xn r MIMO n c -block- fading channel. Assume that S precodes a channel 
block diagonal matrix with s blocks and n s channel realizations. We denote s the spreading factor, 
Nt — sn t and s' — s/n s . Let s{ be the t-th sub-part of size N t /n s of the l-th row ofS. Let sj*" 1 ' 
be the i-th sub-part of size n t of sf\ Let ^ be the j-th sub-part of size s' o/sj*" 1 '. The sub- 
part sf^^ is called nucleotide. The linear precoder guarantees full diversity and quasi-optimal 
coding gain at the decoder output under maximum likelihood decoding of the BICM if it satisfies 
the two conditions of null nucleotides and orthogonal nucleotides for all t £ £ [l,s'],l\ £ 

[l,ss'],l 2 € [l,n t /s'] and{l 2 ,h} = (h - l)ss' + h: 
> 

7^ h,j £ [l,?^t/s'], Srj] j^j = Oixs' Null Nucleotide condition 

< 

W ^ i, i' £ [1, a'], S n^hf S nfh} ] * = ~ Orthogonal Nucleotide condition 

(47) 

where 0(0) = 1 and d(x ^ 0) = 0. 

Let us take for example n t = 4, n s = 1 and s = 2. A DNA matrix would have the following 
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structure: 



DNA(n t =4,n s = l,s = 2) = 



q[1][1][1] 

q[1][1][1] 
3 {1,2} 

5 {1:3} 

q[1][1][1] 
S {M} 










q[l][l][2] 

5 {2,1} 

c[l][l][2] 
{2,2} 

o[l][l][2] 
{2,3} 

c[l][l][2] 
=>{2,4} 



c[l][2][l] 
& {1,1} 

q[l][2][l] 

3 {1,2} 

c[l][2][l] 
3 {1,3} 

tj[l][2][l] 










cW[2][2] 
3 {2,1} 

q[i][2][2] 

{2,2} 

c[l][2][2] 
{2,3} 

o[l][2][2] 
S {2,4} 



(48) 



Now, let us consider a linear precoder matrix S that satisfies Proposition^ Wc build a ss' x N r 
matrix from the rows of corresponding to the i-ih group of s' transmit antennas. Elements 
of are defined as follows: 



Vi = . . . n t /s' - 1, Vj = . . . s' - 1, Vu = . . . s - 1, V« = . . . N r - 1, 

H fc l} (i + us ',v) = Hfc(j + un t + is',v) 



(49) 



Likewise, is the ss' x ss' matrix obtained from the i-th block of ss' rows of S and every 
rit/s'-th block of s' columns beginning with the i-th. block. We easily show that 



s«h1 1} 



S {2} H f } 



S{n t /s'}-fi{ n t/s'} 



(50) 



which means that the matrix S independently precodes the nt/s' groups of transmit antennas. 

Thus, the optimization may be split into nt/s' independent optimizations of linear precoders 
for s' x n r MIMO rt s -block-fading channels with linear spreading factor s. As s = s'n s , full space- 
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time spreading of the s' x n r block- fading channel is performed, i.e., the maximum diversity order 
sn r is collected by the detector. 

From (fT"5|) and (|46|) . we notice that, at the decoder input and under ideal interleaving condition, 
the linear precoder at the transmitter end and the detector at the receiver end allow the conversion 
of the n t x n r MIMO channel with n c independent blocks into a 1 x sn r SIMO channel with n c n t /s 
independent blocks with BSK input. The independence of the blocks is provided by the structure 
of the linear precoding matrix: 

1. The null nucleotides dispatch the transmitted symbols on n t / s' different blocks of s' antennas. 

2. The orthogonal nucleotides provide full diversity and a coding gain increasing with the 
spreading factor. 

For instance, if a rate 1/2 BICM is transmitted on a quasi-static 4x2 MIMO channel, linear 
precoding with s = 2 is required to achieve full diversity: a full-rate space-time block code with 
spreading factor s = s 1 = 2 may independently be applied on 2 separate groups of 2 transmit an- 
tennas. Good 2x2 space-time block codes are for instance the TAST [12] and the Golden code [4]. 

Assume that n c = I, n t — n r = 2 and s = s' = 2. The Golden code is the best space-time 
code for uncoded 2x2 quasi-static MIMO channels. However, it does not satisfy the equal norm 
property of orthogonal nucleotides in Proposition^ Indeed, one row of the Golden linear precoder 
contains two non-null coefficients of square norm ot\ = 0.277 and ai — 0.723, respectively. Thus 
(US]), which assumes equality between the eigenvalues of M^'mJ^*, does not hold. It can be shown 
that (let 7 ? = 7l 2 

e?Go/den(A, A) = yj (a X (7? + l\) + <*2 ill + 7s)) ("l (t| + ll) + »2 hi + 7f )) (51) 

where — 7^ i y. As d,Hmin increases, (7^ + 7 |)/(7| + 73) tends to 1 for any pairwise error 
probability and GGoiden(A, A) — > (?2,i,opt(A, A): The error correcting code limits the coding loss 
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due to the non-equal norm of the sub-parts of the Golden code. As a remark, if 7§ +73 = 0, which 
is the worst case, the coding loss is 10 \og w (yJ a.ia.2 / 4) = 0.5 dB. 

With DNA precoder and ideal interleaving, Assumption[T]is satisfied and the modified Singleton 
bound on the diversity order can apply. All results from the field of error correction coding over 
block-fading channels directly apply without any modification to the new 1 x sn r SIMO channel 
with n t n c /s independent blocks. 

5.3 The genie method design criterion for full spreading linear pre- 
coders (s f = n t ) 

A linear precoding design criterion based on the genie performance optimization at the detector 
output has been proposed in [8]. When a genie gives a perfect information feedback on the mnt 
coded bits required in the APP detector computation, the performance is computed by averaging 
all the pairwise error probabilities obtained when changing only one bit out of mn t . Denote d the 
distance of the BSK. Assume that the BSK is transmitted on antenna /, the asymptotic expression 
of the error probability with genie is 

where A = {S v } is the set of square-roots of distinct non-null eigenvalues of d 2 S'|*'*S'[*' for all t, 
X v their frequency and Ns their number. In the best case, there are s non-null eigenvalues and 
the coding gain is maximized if they are equal. First, a sufficient condition to have an equality 
between the eigenvalues of S'f l] *sf l] and S'f al *sf al is ||Sf l] || 2 = ||S'|* 21 1| 2 . Then, all eigenvalues 
of S'J''*S'{'' are equal if S'J*' is a unitary matrix, which leads to the following proposition: 

Proposition 3 A linear precoder achieving a diversity order sn r with maximum coding gain at 
the detector output must satisfy the following conditions under perfect iterative APP decoding of 
the space-time BIC'M: 

1. The n s subparts of the rows in the sn t x sn t precoding matrix have the same Euclidean norm 
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2. In each of the n s subparts, the s subparts (nucleotides) are orthogonal and have the same 
Euclidean norm 

Proposition [3l which is more intuitive, is equivalent to Proposition [2] only if s' — n t , i.e., in 
case of full spreading. 

5.4 Non-full spreading quasi-optimal linear precoder: DNA cyclotomics 

If s' nt, Proposition [3] is not optimal in terms of maximum likelihood performance. However, 
we can split the optimization of a Nt x Nt linear precoder with spreading factor s into nt/ 1 s' 
optimizations of full spreading N' t x N' t linear precoders with N' t = s's. The optimization of S is 
now done in two steps: 

1. Apply the genie method to design a full spreading N' t x N' t linear precoder for a s' x n r 
MIMO channel with n s blocks, satisfying Proposition [3 

2. Place the non-null sub-parts in S as described in Proposition [5] 

Cyclotomic rotations [3] provide good performance on ergodic Rayleigh SISO channels and have 
the great advantage to exist for any number of complex dimensions. Moreover, any coefficient has 
a unity norm which implies that the norm condition of Proposition [3] is naturaly satisfied. We 
modified the cyclotomic matrices to satisfy the orthogonality condition in the case of full spreading 
s = n t n s . The coefficients of S are equal to 

S l,v+(i-l)n t +(t-l)n* = ,_„. 

(53) 

vk [c - 1) (*=**j + (t ' 1)Tt?+( ;: 1) " t+ "" 1 ) (pfe + ^)] ) 

We denote S(n t ,n s ,n t n s ) the modified cyclotomic rotation designed for a n t x n r MIMO block- 
fading channel, assuming that the precoder experiences n s channel realizations. The last parameter 
in S(n t ,n s ,n t n s ) denotes the spreading factor. 
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To satisfy Proposition which gives the design criterion for non-full spreading quasi-optimal 
linear precoders, we follow the two steps described above. Following (|53p . we first construct 
S(s',n s ,s) designed for full spreading of a s' x n r MIMO block-fading channel with n s channel 
states in each precoded matrix. Then, we place n t /s' times each subpart of S(s' ,n s , s) in the 
precoding matrix in order to satisfy Proposition^ and construct the quasi-optimal linear precoder 
S{nt, n s , s) for any set of parameters n t , n s and s. Its coefficients are equal to 

VZ 2 e [l,n t /s'],VZi e [l,ss'],Vie [l.nJ.Vie [M'],Vue [l,s'], 

S(l2-l)s's+h,v+(l2-l)nt/s'+(i-l)nt+(t-l)s'nt = 

(54) 

(h - 1) + ^±^g±^ ) + a- 1) + ^)]) 

and elsewhere. 

5.5 Performance of the quasi-optimal precoder with iterative receiver 

We have presented quasi-optimal linear precoders providing good coding gain and full diversity 
ML performance under ideal interleaving. However, the ML decoder of the global Euclidean code 
does not exist and we process iterative joint detection and decoding. Proposition [2] is satisfied by 
an infinity of matrices, all providing the same ML performance. Let us consider the performance 
behavior after the first iteration. As no a priori information is available at the detector, errors 
before decoding are numerous and not necessarily transmitted on different precoding time periods. 
Let us consider one precoding time period and assume that we observe two erroneous bits. If the 
bits are transmitted on the same modulation symbol, the Euclidean distance dk changes but this 
does not affect the linear precoder optimization. However, if the two bits are placed onto two 
different rows of S, the average performance might be modified and interference inside a block 
and between blocks should be considered. An optimization of the precoder following the Tarokh 
criterion should be done, under the conditions presented in Proposition^ Simulation results show 
that the modified cyclotomic rotation has good uncoded ML performance, close to algebraic full 
rate space-time block codes. Thus, we expect good performance at the first iteration of a joint 



±= exp ( 2jn 
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detection and decoding process, which is desirable to reduce the number of iterations needed to 
achieve the near ML performance and to provide good performance with non-iterative receivers. 
The optimization of the first iteration is not addressed in this paper, but first answers are given 
in (3]]- 

6 Practical interleaver design for convolutional codes 

The maximum diversity to be gathered is limited by the characteristics of the channel, the linear 
precoding spreading factor and the minimum Hamming distance of the binary code, all summa- 
rized in pip . Assume that the linear precoder spreading factor s is chosen such that diversity order 
is maximized, T max — n c n t n r . Thus, there exists an interleaver that allows ML performance with 
full diversity. We present a new BICM interleaver design which satisfies Definition [T] and leads to 
the concept of full diversity BICM since the system exhibits a predetermined diversity whatever 
the parameters of the considered block-fading channel. 

We first build an interleaver that enables to achieve maximum diversity on an n t x n r quasi- 
static MIMO channel (n c = 1) with BPSK input. Then, we generalize the interleaver construction 
to apply it to higher spectral efficiency modulations, linear precoding and finally block-fading 
channels (n c > 1). 

6.1 Interleaver design for quasi-static MIMO channels with BPSK input 

On quasi-static channels, a codeword undergoes only one channel realization. Let us consider an 
error event in the code trellis for which w coded bits differ from the transmitted codeword. As all 
error events are supposed to have a non-zero probability, the interleaver should be designed for 
any of them. Let us ensure the equi-distribution property that LiNq successive coded bits, LiNq 
being the length of an error path with Lj branches, are transmitted by all the n t transmit antennas 
in the same proportion. To optimize performance, we must also ensure the non-interference of 
erroneous bits within the same time period. In the ML sense, two interfering erroneous bits may 
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either degrade the diversity or the coding gain. When considering the graph representation of our 
system model in Fig. [TJ a time period corresponds to one channel node. Probabilistic messages on 
bits should be independent. Practically, bits inside a channel node should be connected to distant 
positions in the code trellis. These conditions lead to a design criterion for quasi-static channels, 
well known in the algebraic space-time coding theory as the rank criterion [40j and applied here 
to the BICM interleave!'. 

To design an interleaver with size LqNq ensuring that consecutive bits are mapped on different 
symbol time periods over all the transmit antennas, we demultiplex the LcNq coded bits into rit 
vectors of length LqNq l n t- Each of these nt sub- frames is separately interleaved and transmitted 
on a predetermined transmit antenna. However, the demultiplexing step is not simply processed 
via the periodical selection of every nt bits. Indeed, some error patterns of convolutional codes have 
periodic structure. This may result in non-equally distributed erroneous bits on the nt transmit 
antennas and bad coding gain for these error patterns |24j . In order to break periodic structures, 
we apply the following demultiplexing 

< i < nt, 0<j< L c N c /n t , Vi(j) = V ((*+ j) mod n t +jn t ) (55) 

where V is the codeword to be demultiplexed, V, is the i-th demultiplexed frame. This ensures 
the uniform distribution of erroneous bits over n t transmit antennas all along the transmitted 
frame. Let us now limit the interference of erroneous bits during the same time period. We 
assume that only simple error events occur. If the same interleaver is used for all sub-frames, n t 
consecutive bits are in the same position of interleaved sub- frames and we can limit the interference 
by sliding each sub-frame by one bit position and transmit all frames serially on their associated 
antennas. Yet, this does not guarantee that LjNq successive bits are transmitted over distinct 
time periods. To satisfy this strong condition, we use a particular S-random interleaver [16 with 
a sliding input separation which guarantees that any Lj successive bits in the interleaved frames 
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are not transmitted during the same block of n t time periods. If we consider that bit position i is 
placed at position IL s (i) by the interleaver II S , we should have 



< j < L c Nc/n t -Li, < i < L r , 



n s (j) 



U s (j + i) 



(56) 



Each of the nt sub-frames V, is interleaved into V; 



0<i<n u 0<j<L c N c /n t , V, (II s (j)) = V< (j) 



(57) 



Then, a new sub-frame is built from as follows: 

< i < n t , 0<ji< L c N c /n 2 t , < j 2 < n t , V- {(i + j 2 ) mod n t + jmt) = V 4 (j 2 + jwt) 

(58) 

The above construction keeps blocks of n t bits of Vj in positions corresponding to the same nt 
time periods in V^, but with a cyclic shift of i positions in a block of size nt- 

6.2 Basic interleaver construction 

Let us generalize the interleaver construction to design a basic interleaver Inz,Si,Li for Nj channel 
inputs, a frame size Si bits and a separation L/. We described 1 nt .L c vic.Li i n the previous section. 
For more general system configurations, the basic interleaver 2"jV/,S/,L/ will be used in the sequel. 



In Fig. we present the basic interleaver for Ni = 4 channel inputs. Codeword bits are 
distinguished by four different patterns, each pattern corresponding to a specific channel input. In 
step 1, the codeword is demultiplexed into iVj sub-frames V^, i = 0, . . . , Ni — 1, of length Si/Ni 
each, as presented in the previous section. In step 2, each vector of size Si/Ni is interleaved 
by the S-random-like interleaver into a vector V^. In step 3, we build a Ni x Si/Ni matrix as 
the concatenation of Si/Nf matrices of size Ni x Ni. The latter are circulant matrices where the 
first row contains the Ni first values of Vq, and the second row contains the first 2Vj values of Vi. 
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Rows 3 and 4 are built from V2 and V3 similarly. 

Finally, the Ni x Sj/Nj matrix is transmitted over the space-time channel by distributing its 
rows on channel inputs and its columns on time periods. 



This interleaving guarantees that (Lj — l)iVj + 1 consecutive codeword bits are not transmitted 
during the same time period. The value of Lj of the S-random-likc interleaver should be chosen 
as large as possible in order to take into account long error events. An upper bound for L/ can be 
found based on the interleaver separation similar to classical S-random [16j . The interleaver has a 
sliding input separation equal to Li and an output block separation equal to Ni within a sub- frame. 
Hence, drawing a a simple two-level tree representation would lead to (2L/ — 1) X Nj < S[/N[, 
rewritten as 



6.3 Interleaver design for quasi-static MIMO channels with M-ary input 

In section I5TT1 we have presented an interleaver for MIMO quasi-static channels and BPSK modu- 
lation. For a modulation with higher spectral efficiency, erroneous bits in an error path should be 
dispatched on different time periods and equally transmitted over all the transmit antennas and 
bit positions. Repartition on different bit positions is required as different bits of a modulation 
scheme are not equally protected. These conditions are satisfied by the T mntt L c N Ci Li interleaver. 

Increasing the diversity by transmitting erroneous bits on all antennas is more important than 
increasing the coding gain by transmitting them on all modulation bits. The nt first sub-frames 
should be transmitted on the n t transmit antennas and on the first mapping bit. The second block 
of n t sub-frames should be transmitted on the second mapping bit, and so on. 

6.4 Application to linear precoding 

When a linear precoder is used to recover a part of the transmit diversity, the new channel matrix 
SH has sn t x sn t rows and columns. Linear precoders have been optimized in section [5] when 




(59) 
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at most one erroneous bit is observed on each precoding time period. We have shown that the 
precoded channel output is divided into independent blocks, we modify the order of the rows as 
follows (s' = s/n s and N' t = s's) 

VZ 2 £ [l,nt/s'],Vli e [l,ss'],Vt € [l,n s ],Vi E [l,s'],Vv € [l,s'], 

S(lx-l)nt/s'+l 2 ,v+(l2-l)nt/s'+(i-l)nt+(t-l)s'rit = 

(60) 

(h - i) (H^y + ^-^-^' 2 ) + (i- i) + *?)]) 

and elsewhere. 

Now, the nt/s' consecutive rows of S lead to independent row vectors SjHfc that look like a true 
multiple antenna channel. In this case, the interleaver T S mn t ,L c Nc .Li is designed for diversity and 
gain exploitation. As presented in the previous subsection, the sn t first rows of the last interleaver 
matrix will be transmitted on the first mapping bit, and so on. 

6.5 Interleaver design for block-fading MIMO channels 

For block-fading channels, n c different channel realizations occur during the codeword. In order 
to take advantage of the transmission and time diversity given by the linear precoding and the 
n c different realizations of a block-fading MIMO channel, the interleaver of a BICM should place 
consecutive bits on different precoding time periods and equally distribute them among all linear 
precoding rows and all n c channel realizations. 

We extract n c sub-frames from the codeword, each sub-frame will be transmitted on one of the 
n c blocks, and only experience one channel realization. We interleave each sub-frame with the 
interleaver optimized for MIMO quasi-static channel to exploit the linear precoding diversity. 
The demultiplexing into n c sub-frames is done in the same manner as for the channel inputs in 
step I of Fig. [51 

< i nc <n c ,0<j < L e N e /(n c n t ),V? c (j)=V((i nc + j) mod n c +jn c ) (61) 
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This demultiplexing/interleaving is sufficient to exploit the time diversity. Indeed, there is no in- 
terference between the symbols experiencing the different channel realizations contrary to symbols 
transmitted on different linear precoding rows and bit positions. 

6.6 Application to turbo-codes 

The BICM precoder and interleaver have been designed to provide full diversity and optimal 
coding gain for any pairwise error probability. However, the error rate is given by the probability 
to leave the Voronoi region. With convolutional codes, the number of neighbors increases with the 
frame length whereas the minimum Hamming distance djimin remains constant. Thus, the frame 
error rate increases with frame length. To obtain the opposite behavior, the Euclidean distance 
must increase with frame length and provide a performance gain higher than the performance 
degradation due to the increased number of neighbors. It has been shown in [25j[9] that turbo- like 
codes can fulfill such a condition over block- fading channels. As proposed in [23], we modify the 
classical parallel turbo code with two encoders RSC1 and RSC2 and an interleaver H t by adding 
a de-interleaving H^ 1 of coded bits at the output of RSC2. Thanks to this de- interleaving, error 
events are localized and the optimized channel interleaver can be applied. 

7 Simulation results 

In this section, we evaluate the performance of actual iterative joint detection and decoding of 
the ST-BICM. The APP detector is performed by exhaustive marginalization. The set of 2 mNt 
noiseless received precoded symbols zSH is computed once per channel block realization since 
the channel matrix SH is constant during the block. This results in a complexity reduction for 
the marginalization, which now requires around LqNc / (msnt)2 mNt operations per iteration if 
s <C LqNq. For large values of mN t , the complexity of the exhaustive search becomes prohibitive. 
In order to cope with complexity issues, quasi-optimal or sub-optimal MIMO detectors may also 
be used, e.g., a SISO list sphere decoder [2SJ [35] [3] , a SISO-MMSE detector [17] [UJ or a detector 
using sequential Monte Carlo method [T5] , 
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Let us consider a 2 x 1 quasi-static (n c = 1) MIMO channel and QPSK modulation. We use 
(7, 5) 8 NRNSC or (3, 2) 8 NRNSC codes with rate 1/2 and a blocklength of 1024 coded bits. From 
the Singleton bound we know that full diversity can be achieved without linear precoding. We 
compare on Fig. [5] the performance obtained with a classical PR intcrlcavcr and the performance 
obtained with the optimized interleaver described in section[U Full diversity order is only achieved 
with the optimized intcrlcavcr, for which the performance slope is equal to the one of the outage 
probability. The optimized interleaver provides performance improvement without any increase of 
complexity neither at the transmitter nor at the receiver. In most cases, the PR interleaver only 
provides a diversity n r , i.e., it does not allow any transmit diversity order recovery. The (7, 5)s 
NRNSC code achieves a higher coding gain than the (3, 2)g NRNSC code. It achieves performance 
within only 2.5 dB from the outage capacity with Gaussian input and within 1.5 dB from the 
outage capacity with QPSK input. The performance lower bound corresponding to ideally pre- 
coded BICM is also drawn. It is obtained from the performance of the same coded modulation 
transmitted on a 1 x n c ntn r SIMO channel, as explained in section 15.11 There is a 1 dB gap 
between ideal and actual performances with the (3, 2)g NRNSC code and a 0.75 dB gap with the 
more powerful (7,5)8 NRNSC code. This confirms the analytical result of section I5TT1 obtained 
for ML performance: The higher the Hamming weight is, the closer to the ideal performance 
the actual iterative receiver can perform. However, a better code does not always provide better 
frame error rate. Indeed, we have seen that, when w > ntn c , the full diversity of the considered 
pairwise error probability can be achieved with an ideal interleaver. The remaining w — n t n c BSK 
distances are uniformly distributed among all the channel states. A better error correcting code 
with greater Hamming weights w' does not enhance the diversity but the coding gain per pairwise 
error probability. However, the degradation induced by the increased number of neighbors may 
be higher than the improvement brought by increased coding gains. How to handle this trade-off 
is left for further study. 
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In Fig. we show the performance of a rate-1/2 (7, 5) 8 NRNSC code over a 2 x 2 MIMO 
block-fading channel with n c = 2 and QPSK input. The frame length is 256 coded bits. With a 
PR interleaver, a diversity order n r = 2 is achieved, as transmit diversity is not collected. Even 
with the optimized interleaver, full diversity is not obtained at the last iteration. Indeed, the 
Singleton bound is equal to 6 without linear precoding. Two different linear prccoders, the Golden 
code and the DNA code, both with s = 2, arc used to achieve the full diversity order 8. The slope 
difference between diversity orders 6 and 8 is not significant. However, linear precoding provides 
an additional coding gain which allows to perform within 2 dB from the outage capacity with 
Gaussian input using a four-state convolutional code and a small frame length. The Golden code 
does not satisfy the equal norm condition, which induces a slight loss in coding gain. Neverthe- 
less, this loss is fully compensated by the averaging of the dkj^ into equal values provided by 
the error correcting code as explained in 15.11 For a higher frame length, the performance with 
convolutional codes is degraded. Therefore, we will also investigate performance with turbo-codes. 

In Fig. [8j we compare two strategies for achieving full diversity with BICM: linear precoding 
and constellation expansion [26) . Constellation expansion consists in increasing m while decreas- 
ing the coding rate, in order to achieve the full diversity without precoding and with the same 
spectral efficiency. A MIMO 2x2 channel with n c — 2 is considered. The frame length is 1024 
coded bits. Using QPSK modulation and rate-1/2 coding, full diversity is not achieved. Using 
a precoded QPSK with s = 2 and a 16-state rate-1/2 (23, 35)s NRNSC code having minimal 
Hamming distance 7, we get the same spectral efficiency, 2 bits per channel use, and the Singleton 
bound is equal to 8, the full diversity order. We compare this full-diversity scheme using linear 
precoding with a scheme using constellation expansion from QPSK to 16-QAM with a 64-state 
rate-1/4 NRCSC code having generator polynomials (135, 135, 147, 163)s and minimal Hamming 
distance 20. With the latter scheme, we get the same spectral efficiency and the Singleton bound is 
also equal to 8. The linear precoder provides a greater diversity order at the first iteration. At the 
last iteration, both schemes have same diversity and the precoded scheme slightly outperforms the 
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scheme with constellation expansion. Since the detector complexity is around LcNc/ (msn t )2 mNt 
operations per iteration if s <C LcNq, the detection of the precoded system is as complex as 
the detection of the one with constellation expansion. However, channel decoding of the 64-state 
(135, 135, 147, 163)s NRNSC code is more complex than the decoding of the 16-state (23,35)s 
NRNSC code. Thus, to get a same performance, it is less complex to use linear precoding than 
to use constellation expansion. When choosing a 64-state NRNSC (133, 171)s code with rate 1/2 
and minimal Hamming distance 10, the coding gain is increased by almost 1 dB. 

In order to increase the frame length without degrading performance, we now consider turbo- 
codes. Fig. [5] illustrates the performance of a (7, 5)g RSC turbo-code over a 1 x 1 channel with 
n c = 4, 16-QAM input and either a PR or an optimized interleaver. Two different frame lengths 
(256 and 2048 coded bits) are tested. With the PR interleaver and without precoding, the full 
diversity order 4 is not achieved. If the optimized interleaver is used, the full diversity order is 
not achieved neither, but the smaller slope is not visible down to a FER equal to 10 -3 . A similar 
behavior is obtained with PR interleaver and precoding s — 2. Finally, the DNA precoded mod- 
ulation with optimized interleaver achieves full diversity performance within less than 2 dB from 
the outage capacity with Gaussian input. 

Fig. fTm illustrates the performance of a (7, 5)s RSC turbo-code over a 2 x 2 quasi-static channel 
with QPSK input and either a PR or an optimized interleaver. Two different frame lengths (256 
and 2048 coded bits) are tested. With the PR interleaver, the full diversity order 4 is not achieved, 
and the performance degrades when the frame length increases, as with convolutional codes. With 
the optimized interleaver, the full diversity order is achieved and the frame error rate decreases 
when the frame length increases. The system using DNA precoding (s = 2), optimized interleaver 
and a turbo code finally performs within 1 dB from the outage capacity with Gaussian input. 

Fig. ED represents the performance of a (7, 5)s RSC turbo-code over a 4 x 1 quasi-static chan- 
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nel with BPSK input and either a PR or an optimized intcrlcavcr. Two different frame lengths 
(256 and 2048 coded bits) are tested. Without linear precoder and using a PR interleaver, the 
full diversity gain is not achieved. Asymptotically, the observed diversity is n r = 1, but, for 
low Ei,/ No, the performance is close to the performance obtained with the optimized interleaver. 
Indeed, the turbo-code generates a large amount of errors for low Eb/No and the probability of 
satisfying the ideal interleaving condition with a PR interleaver is high. However, when Eb/No is 
high, only neighbors have an influence on the error rate and it is crucial to place the few erroneous 
bits on all the channel states. This behavior is stressed with increased frame length. To achieve 
maximum diversity, according to the Singleton bound, a precoding with at least s — 2 is needed. 
This is confirmed by the simulation results and again the error rate decreases when the frame 
length increases. With the 4x1 MIMO channel, a large amount of interference exists between the 
transmit antennas. Nevertheless, performance is within 2.5 dB from the outage probability with 
Gaussian input. Performance will be even closer to the outage probability with a higher number 
of receive antennas or channel realizations. 

On Fig. performances of NRNSC codes and parallel turbo-codes with RSC constituent 
codes over a 2 x 1 quasi-static MIMO channel are drawn versus frame size for Eb/No — 15 dB. 
Performance of the Alamouti scheme pQ having same spectral efficiency without channel coding is 
also drawn as a reference. The frame error rate increases with the frame size when using Alam- 
outi scheme or NRNSC codes whereas it remains constant when using turbo codes. This strong 
property may be in part explained by the interleaving gain of the turbo-code but further research 
is required on this point. 
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8 Conclusions 

In this paper, we have analyzed the ideal behavior of an ST-BICM using full-rate linear precoding 
on a MIMO block-fading channel. Ideal performance has been derived analytically using exact 
pairwise error probabilities under ideal interleaving conditions. Using a bound on the diversity 
order, we have shown how to set the time dimension of the linear precoder. Then, we have 
presented how to design the linear precoder and the interleaver to obtain an ST-BICM achieving 
full-diversity and performing close to the ideal performance and the outage probability. Fig. 1131 
summarizes the optimization steps followed in this paper. The proposed DNA precoder slightly 
outperforms the algebraic Golden code. Furthermore, the design of DNA precoders holds for any 
parameter set (rit,s,n s ), whereas algebraic codes have to be specifically designed for each pair 
(n t ,s). We have also shown that, for a same performance, using linear precoding is less complex 
than using constellation expansion. Finally, using turbo codes with the optimized interleaver, we 
have obtained an FER which does not increase with the frame length. 

A Derivation of En k \$ LLR k {jv)\ f° r block- fading channels 

We first consider n s = 1 and extend the result to any value of n s . 

A.l Precoding matrix experiences one channel realization (n s = 1) 

For n s = 1, the quasi-static channel matrix Hfe is defined as = diag ^H^, . . . , Hj. 1 '^ , Hj. 1 ' being 
repeated s times. From Sj, we construct the sxn t matrix Sj' 1 ' = fsj 1 " 1 ' T , S^" 2 ' T , . . . , sj 1 ""' T ") . 



The row vector S\ lm of size nt denotes the i-th sub-part of S; . The n r columns of are 
independent realizations of an n t x 1 multiple-input single-output channel. Let us define Ml 1 ' as 



an n t x n t Hermitian square root matrix of Si 



E^S'psf . Thus, 




(62) 
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where 3> s [i] = diag('!?^ 1) . . . , tfj^), being the u-th real eigenvalue of S^, and U is a unitary 



matrix. We write 



E -EE TMhjsf l*sf ^ = Tr £ M^Mp (63) 



i=i i=i 2=1 



The random variable X)"=i m| ?" hjh*M^ has a Wishart distribution with n r degrees of freedom 
and parameter matrix sL 1 . The characteristic function of the trace of X)"=i ^VI^ 1 ^ h.ih*]VE^ L -' is given 
in [33]. Finally, 



P 2 7V 



(64) 



det(EW)det(EW- 1 _^) 7nt)) (65) 



u=i 



(66) 



A. 2 Precoding matrix experiences several channel realizations (n s > 1) 

For n s > 1, we first decompose each row S/ into n s sub-parts of size N t /n s , denoted S{* ] . Then, 
each sub-part sj*' is decomposed into s/n s sub-parts s[*'^ of size nt- As different values of 
t correspond to independent channel matrices H^, the characteristic functions associated with 
the sub-parts sf^ can be multiplied. Substituting s with s/n s in the mathematical development 
presented in section [A.li we directly have 

E Hk [* LLRk (j,)} = n n ( 1 -^^t ) (67) 

f=lu=l \ / 

where is the u-th eigenvalue of 

JV t s/n a 

4 t] =E^E s i t]M * s P ( 68 ) 

i=l i=l 
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Table 1: Diversity order from modified Singleton bound versus number of transmit antennas rit 
and spreading factor s, for Rq = 1/2, n r = 1 and n c = 1. 
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Table 2: Diversity order from modified Singleton bound versus number of transmit antennas rit 
and spreading factor s, for Rc — 1/2, n r = 1, n c = 2. 
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Table 3: Best gain in dB to be provided by linear precoding with respect to an unprecoded system, 
with ideal interleaving and for a given pair of codewords with Hamming distance w and BPSK 
input. 
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Figure 1: Bit-interleaved coded modulation transmitter and multiple antenna channel model. 
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Figure 2: Iterative APP detection and decoding receiver. 
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Figure 6: Optimized interleaver with rate-1/2 NRNSC codes - QPSK modulation, 2x1 MIMO 
channel, n c — 1, 10 iterations, L C N C = 1024. 
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Figure 7: Optimized interleaver with rate-1/2 (7, 5)s NRNSC code and linear precoders - QPSK, 
2x2 MIMO channel, n c = 2, 5 iterations, L C N C = 256 - No linear precoder, DNA cyclotomic 
precoder (s = 2 7 n s — 1), Golden code (s = 2,n s = 1). 
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Figure 8: Constellation expansion versus linear precoding -2x2 MIMO channel, n c = 2, L C N C = 
1024, optimized interleaver. 
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Figure 9: Optimized interleaver with rate-1/2 RSC (7, 5)s turbo-code and DNA cyclotomic pre- 
coder - 16-QAM, lxl MIMO channel, n c = 4, 15 iterations, L C N C = 2048 - Parity check bits of 
the second constituent are multiplexed via the inverse turbo interleaver. 
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Figure 10: Impact of frame size with a rate-1/2 RSC (7, 5) 8 turbo-code - QPSK, 2x2 MIMO 
channel, n c = 1, 15 iterations - Parity check bits of the second constituent are multiplexed via the 
inverse turbo interleaver. 
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Figure 11: Impact of frame length with a rate-1/2 RSC (7,5) 8 turbo-code - BPSK, 4x1 MIMO 
channel, n c — 1, 15 iterations - Parity check bits of the second constituent are multiplexed via the 
inverse turbo interleaver. 
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Figure 12: Frame error rate versus the frame size L C N C - BPSK, 2x1 MIMO channel, n c = 1, 
Eb/No = 15 dB - Alamouti STBC, NRNSC codes, Rate one half punctured parallel turbo codes - 
Parity check bits of the second constituent are multiplexed via the inverse turbo interleaver. 
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Figure 13: Summary of the space-time BICM optimization process for a n t x n r MIMO n c -block 
fading channel. The parameters s and n s are the time dimension of the sn t x sn t precoding matrix 
S and the number of independent block channel realizations linked by the precoder respectively. 



62 



